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Abstract 

Background: Serum carcinoembryonic antigen (sCEA) level might be an indicator of disease. Indeed, an elevated sCEA level 
is a prognostic factor in colorectal cancer (CRC) patients. However, the genetic determinants of sCEA level in healthy and 
CRC population remains unclear. Thus we investigated the genetic markers associated with elevated serum sCEA level in 
these two populations and its clinical implications. 

Methods and Findings: Genome-wide association study (GWAS) was conducted in a cohort study with 4,346 healthy male 
adults using the lllumina Omni 1 M chip. Candidate SNPs associated with elevated sCEA levels were validated in 194 CRC 
patients on ABI Taqman platform. Eight candidate SNPs were validated in CRC patients. The rsl 047781 (chr19- FUT2) (A/T) 
was associated with elevated sCEA levels, and rs81 76746 (chr9- ABO) was associated with the regional lymph metastasis in 
the CRC patients. The preoperative sCEA level was a risk factor for tumor recurrence in 5 years after operation (OR = 1.427, 
95% CI: 1.005-1.843, P = 0.006). It was also one of the risk factors for regional lymph node metastasis (OR = 2.266, 95% CI: 
1.196-4.293, P = 0.01 2). The sCEA level in rsl 047781 -T carriers was higher than that in the A carriers in CRC patients without 
lymph node metastasis (P = 0.006). The regional lymph node metastasis in patients with homozygote AA of rs81 76746 was 
more common than that in the heterozygote AG carriers (P = 0.022). In addition, rsl 047781 -AT and TT CRC patients 
exhibited a worse disease-free survival than AA genotype carriers (P = 0.023). 

Conclusions: We found candidate SNPs associated with elevated sCEA levels in both healthy males and CRC population. 
Rsl 047781 (chr19- FUT2) may be the susceptible locus for recurrence of CRC in a population from Southern China. 
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Introduction 

Colorectal cancer (CRC) is the fourth most common cancer in 
males and the third most common in females worldwide. It 
accounts for an estimated 1.2 million new cancer cases and over 
630,000 cancer deaths per year [1]. Incidence and mortality of 
CRC rose rapidly in China since the 1990s [2]. Sporadic CRC is 
the most common that occurs in the absence of family history, 
generally affects older population, and usually presents as an 
isolated colon or rectal lesion. Curative surgery is the most 
important approach of treatment. However, 5-year survival after 
resection is merely 60.8 % in China. Approximately 50% of these 
patients would undergo tumor relapse and die [3]. The prognosis 
was affected by factors such as cigarette smoking, age, gender and 
race, duration of symptoms, presence of bowel obstruction, tumor 
location, blood transfusion and the quality of surgical intervention 
[4]. 

Carcinoembryonic antigen (CEA) level might be elevated in the 
serum of people with non-neoplastic diseases, malignant tumors or 
cancers. Thus an elevated CEA level is an independent prognostic 
factor in CRC patients regardless of its Duke's stage and 
histological grade [5]. In the healthy population, serum CEA 
levels could be affected by environmental and genetic factors such 
as age, smoking, drug, disease, gender and race [6-8] . However, in 
patients with CRC, additional mechanisms determining serum 
CEA level remain unclear. 

Genome mutations associated with the elevated CEA level in 
CRC is present in the tumor itself, so such mutations could not be 
served as a genetic marker for CEA level for diagnostic purposes 
[9] . Therefore, this study was conducted in two phases: at phase 
one, a genome-wide association study was conducted to identify 
the loci associated with the serum level of CEA in healthy Chinese 
men from a cohort that participated in the Fangchenggang Area 
Male Health and Examination Survey (FAMHES) (this part has 
been merged with other cohorts and partly published elsewhere 
[10], [1 1]). At phase two, the association between candidate SNPs 
found from phase one GWAS and preoperative CEA levels and 
prognosis of patients with CRC was analyzed. Based on this 
design, we tried to explore in this study the association between 
polymorphisms and the prognosis of CRC after radical operation 
on account of susceptible loci associated with elevated sCEA level. 

Methods 

Phase one study 

A genome-wide association study was conducted to identify the 
loci associated with the serum level of CEA in healthy Chinese 
men from FAMHES cohort (The experiment and results were 
described in Text SI, Tables SI, S2, S3, and Figure SI). 

Phase two study 

Patients with sporadic colorectal cancer. One hundred 
and ninety-four colorectal carcinoma patients who met the criteria 
for enrollment were recruited from patients undergoing curative 
resection between March 2008 and February 2012 at the 
Department of Anal and Colorectal Surgery, I s Affiliated Hospital 
of Guangxi Medical University. All tumors in the study were 
adenocarcinoma, and the pathological diagnoses were confirmed 
independendy by two expert gastrointestinal pathologists. All 
patients were absent of symptoms such as severe bleeding, bowel 
obstruction or infection. None of them had any preoperative 
radio-chemotherapy. All patients were treated with postoperative 
chemotherapy protocol based on NCCN Clinic Practice Guideline 



in Oncology. Written informed consent was required from all 
patients. They were followed up to 51 months, and by the end 
there were 57 patient with tumor recurrence (median survival: 48 
months; range: 7^62 months). This study was approved by the 
ethics committee of Guangxi Medical University. Baseline 
characteristics of participants were summarized in Table S4. 

Genotyping of candidate SNPs in CRC patients. Nine 
candidate SNPs from the 25 SNPs that reached genome-wide 
significance were selected for genotyping in CRC patients: 
rs8176746 (chr9- ABO), rsl047781 (chrl9-FUT2), rs3760775 
(chrl9-FUT6), rs441810 (chr21-FAM3B), rsl2608544 (chrl9- 
DBP), rs3786749 (chrl9- SULT2B1), rs2071699 (chrl9- FUT1), 
rs507666 (chr9- ABO) and rs687289 (chr9- ABO). 

DNA was extracted from the colon mucous membranes using a 
DNA Extraction Kit (Tiangen, China). PCR platform (STEPONE 
PLUS A TM, Applied Biosystem. Foster city, CA) was used to 
conduct allele's genotyping. Automatic SNP genotyping was run 
by StepOne Plus A TM version 2.2 system (Applied Biosystem. 
Foster city, CA); the quality control criteria used were the same as 
that in GWAS. The details of experiment are described in Text SI 
and Table5. 

Chemotherapy. According to the risk for post-operation 
recurrence, some patients at stage Hand all stage III patients 
received at least one cycle of Oxalipatin, Fluorouracil, and 
Leucovorin (FOLFOX regiment) as adjuvant treatment in our 
study. Patients at age ^70 years did not receive adjuvant 
treatment. Adverse effects were assessed using the Common 
Toxicity Criteria of National Cancer Institute version 2.0. 
Chemotherapy was stopped when there was cardiac or neurocer- 
ebellar adverse effects or grade 3 or 4 allergic reaction. 

Follow-up. Patients in our study were assessed every six 
months after operation. The recurrence was determined on the 
basis of physical examination, chest radiography, abdominal 
ultrasonography or computer tomography, colonoscopy, CEA 
level detection, and if necessary, cytologic analysis or biopsy. The 
cut-off date of the analysis was June 20, 2013; duration of follow- 
up was defined as the number of months from operation to the 
cut-off date. The disease-free survival was measured from the time 
of operation to the recurrence of tumor. 

Statistical analyses. Nonparametric statistics was used in 
univariate analysis. Multivariable models were constructed by 
Logistic regression analysis. Kaplan-Meier estimators were applied 
in calculating survival function; the log-rank test was used to 
determine the survival differences among individual groups. The 
SPSS v. 17.0 program was used for analysis and a P value <0.05 
was considered statistically significant. 

Results 

Factors affecting sCEA levels in healthy population 

In General Linear Regression model, Age (Beta = 0.003, P< 
0.0001), total quantity of cigarettes (Beta = 0.004, P<0.001) and 
BMI (Beta = -0.005, P = 0.006) was significantly associated with 
CEA level (Table 1). R 2 of this model was 0.07 (F= 51.834, 
ANOVA). 

Genotyping in CRC patients 

The candidate SNPs used to genotype for 194 CRC patients 
with adenocarcinoma and the genotyping results were listed in 
Table S6. 
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Table 1. Multiple linear regression analysis of the factors affecting sCEA levels in FAMHES participants. 





Variables 


B 


Beta 


P 


95% C.I for B 


Adjusted R 2 


(constants) 


1.365 


0.043 


<0.0001 


0.19-0.357 


0.070 


Age 


0.003 


0.147 


<0.0001 


0-0.004 




Total quantity of cigarettes 


0.004 


0.175 


<0.0001 


0-0.005 




BMI 


-0.005 


-0.06 


0.006 


-0.008 - 0 





B: unstandardized coefficients; Beta: standardized coefficients. Data of CEA values had been log-transformed for normal distribution. Regression model (F-value of 

ANOVA: 51.384; p<0.01). 

doi:1 0.1 371 /journal.pone.0097923.t001 



Risk factors for elevated CEA levels in CRC patients 

The univariate analyses revealed that the preoperative CEA 
level was significantly different by status of regional lymph nodes 
and distant organ metastases (positive vs. negative: 5.36 vs. 
3.41 ng/ml; P = 0.011; 11.87 vs. 3.60 ng/ml; P = 0.033. respec- 
tively). The preoperative CEA level was also not associated with 
factors such as age, gender, cigarette smoking, alcohol drinking, 
tumor size and cell differentiation (Table 2). 

In patients without regional lymph nodes metastasis, subgroups 
with rsl047781 (chrl9- FUT2) AT and TT aUele had a 
significantly higher CEA level than the AA carriers 
(AA: 1.7 3 ng/ml, AT: 3.49 ng/ml and TT: 4.11 ng/ml; 
P = 0.006). However, the difference was not significant in the 
CRC patients with lymph node or distant organ metastasis (AA vs. 
AT: P = 0.686, AA vs. TT: P = 0.202, AT vs. TT: P = 0.139, 



respectively). The J-T test showed that the TT allelic subgroup 
had a higher sCEA level while the AA had the lowest (P = 0.01 1). 
Other candidate SNPs were not associated with CEA levels in 
CRC patients (Table 3). For rsl047781 (chrl9- FUT2), the allelic 
genotype TT was associated with the highest sCEA level, and the 
difference in the sCEA level between allele T carriers and 
homozygote AA was significantly different (AA vs. AT: P = 0.008; 
AA vs. TT: P = 0.006; AT vs. TT: P = 0.787; respectively) 
(Table 4). 

Risk factors for tumor metastasis in CRC patients 

The rate of regional lymph node (RLN) metastasis in the allelic 
subgroup AT was the highest in the three subgroups of rs 1047 781 
(AA: 12.8%, AT: 19.6%, TT: 9.0%; AA vs. AT: P= 0.961, AA vs. 
TT: P = 0.046, AT vs. TT: P<0.001; respectively). The rates of 



Table 2. Univariate analysis 


of factors affecting the preoperative CEA levels 


in CRC patients. 






Variables 




n 


sCEA Level (ng/ml) 1 


P value 


Age (yrs.) 


£60 


93 


3.30 


0.077 




<60 


96 


4.34 




Gender 


Male 


117 


3.67 


0.689 




Female 


72 


4.19 




Cigarette smoking 


Yes 


45 


4.29 


0.606 




No 


145 


3.49 




Alcohol drinking 


Yes 


41 


4.61 


0.202 




No 


149 


3.49 




Tumor size(cm} 2 


==5 


94 


3.52 


0.194 




<5 


94 


4.66 




Tumor Differentiation 


Well 


9 


3.43 


0.719 




Moderately 


153 


3.80 






Poorly 


21 


3.49 




Distant organ metastasis 


Yes 


16 


11.87 


0.033 




no 


174 


3.60 




Regional lymph nodes metastasis 


yes 


77 


5.36 


0.011 




no 


112 


3.41 




Depth of tumor invasion 


I 


1 




0.129 




II 


44 


2.26 






III 


60 


3.54 




IV 1 



'The CEA level was represented by median. 

2 The tumor size was represented by the max diameter of primary tumor. The patients with regional lymph nodes metastases or distant metastases were excluded in this 
analysis. 

doi:1 0.1 371 /journal.pone.0097923.t002 
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Table 5. Logistic regression 
operation. 


analysis of the risk factors influencing reg 


onal lymph nodes with tumor metastasis before radical 




variables 


B 


S.E 


OR 


95%C.I. for OR 


P 


T stage 


0.613 


0.218 


1.847 


1.203-2.835 


0.005 


Preoperative CEA levels 


0.818 


0.326 


2.266 


1.196-4.293 


0.012 


Ages 


-0.880 


0.326 


0.415 


0.215-0.801 


0.009 


constant 


-2.198 


0.365 


0.111 




0.001 



Variables in the equation included age, gender, tumor size, tun 
doi:1 0.1 371 /journal.pone.0097923.t005 



distant organ metastasis in the three allelic subgroups of rsl047781 
were not significantly different (AA vs. AT: P = 0.65 1 ; AA vs. TT: 
P = 0.607; AT vs. TT: P = 0.888, respectively) (Table 4). The 
frequency of RLN metastases was significantly different in the 
allelic subgroups of rs8 176746 (chr9-ABO) (P = 0.022). 

Risk factors for disease-free survival in CRC patients 

Logistic regression analysis revealed that age, preoperative CEA 
level and T stage (tumor invasion) were risk factors for regional 
lymph nodes metastasis (Table 5). The preoperative CEA level 
was also the risk factor for tumor recurrence in 5 years after radical 
operation (Table 6). The probability of survival was higher in the 
group of regional lymph nodes negative than those positive for 
metastasis, and was lower in the group with a preoperative CEA 
level of S5 ng/ml than those with <5 ng/ml. The differences 
were statistically significant (Figure 1, 2). SNPs associated with 
the CEA level and regional lymph metastasis, rsl047781 (chrl9- 
FUT2) and rs8 176746 (chr9- ABO), were not the independent 
contributors to disease-free survival (Figure 3, 4). We used the 
combination of SNPs (rsl047781 and rs8176746) as the genetic 
marker in survival analysis and the disease-free times was 
significantly different between rsl047781-AA and rsl047781 
(AT+TT) genotype carriers among rs8 176746 (AA+AG) carriers 
(P = 0.023) (Figure 5). However, the difference was not determined 
among the rs8176746-GG carriers. 

ABO genotypes and serum CEA levels 

The distribution of CEA levels by ABO blood allelic type was 
significantly different in healthy people (BB>BO>AB>00> 
AO>AA, P= 1.47E-26). Although it is not statistically significant 



(P = 0.458), we also found a similar trend in the CRC patients (B> 
A) (Figure 6, 7). 

Discussion 

Factors affecting Serum CEA levels in the healthy 
population 

In support of a previous report on healthy volunteers in the 
United States, we found that cigarette-smoking is associated with 
CEA levels [12]. Furthermore, when the smoking status is further 
quantified by the amount of cigarettes (pack per year), a trend in 
the association between smoking grading and CEA levels was 
statistically significant. Long term exposure to carcinogens from 
cigarette-smoking contributes to a variety of malignancies, such as 
non-small- cell lung cancer, pancreatic cancer and oral cancer 
[13], [14]. In addition, some studies have reported that an elevated 
CEA level associated in such malignancies [15] — [17]. Therefore 
the elevated CEA level might reflect the risk for tumor. 

It has been reported that CEA levels increase with aging in 
France and Netherland populations [18], [19]. However, the trend 
for higher CEA levels in elders does not necessary correlate with a 
higher risk for non-neoplastic diseases or cancer [20] . Our finding 
of the association between elevated CEA level and aging in men 
from the Fangchenggang area support the notion that age is an 
independent factor influencing the CEA level in spite of the 
disparity of population and area. Interestingly, in male smokers, 
the contribution of age to CEA levels was attenuated. 

We found in this study a negative correlation between BMI and 
CEA level in the healthy male Chinese. It is therefore necessary to 
consider BMI when conducting GWAS in this population. 



Table 6. Logistic regression analysis of risk factors influencing 5-yr recurrence in CRC patients. 



Variables B S.E OR 95%C.I. for OR p 



Tumor site 


-0.492 


0.317 


0.612 


0.329-1.138 


0.121 


Macropathological type 


-1.122 


0.709 


0.326 


0.081 -1.307 


0.114 


Cell differentiation 


-0.433 


1.108 


0.649 


0.074-5.696 


0.696 


Lymph node metastasis 


1.103 


0.966 


3.014 


0.454-20.002 


0.253 


Chemotherapy 


-0.81 1 


1.070 


0.414 


0.051 -3.373 


0.410 


Tumor size 


1.361 


0.933 


3.898 


0.626-24.276 


0.145 


Gender 


0.950 


0.890 


2.586 


0.452-14.801 


0.286 


Age 


0.013 


0.03 


1.013 


0.956-1.073 


0.670 


Preoperative CEA levels 


0.356 


0.131 


1.427 


1.005-1.843 


0.006 


Constant 


-0.350 


3.457 


0.704 




0.919 



'The patients with distant organ metastases were excluded. 
doi:1 0.1 371 /journal.pone.0097923.t006 
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Figure 1. Tumor-free survival analysis of CRC patients stratified by regional lymph nodes. The tumor-free time was different between 
CRC patients with and without lymph node metastases (p<0.001). 
doi:1 0.1 371 /journal.pone.0097923.g001 



Similarly, Park JS et al. reported that increased BMI is associated 
with lower CEA concentrations in CRC patients [21]. Epidemi- 
ological studies have also found an association between obesity and 
colorectal cancer [22]. In vitro studies have revealed that the 
adipocyte could promote the growth of several colon cancer cell 
lines [23]. Therefore it has been hypothesized that although the 
CEA is produced by epithelial cells, adipose tissue could affect the 
secretion of CEA in other tissues via various cytokines and 
adipocytokines. 

HBV infection is associated with the risk of hepatocellular 
carcinoma [24]. However, there is no direct evidence of 
association between HBV infection and digestive malignancies 
such as pancreatic cancer and gastrointestinal cancer [25]. In this 
study, statistical analysis showed that there was no significant 
difference in the serum CEA levels between groups that are 



positive or negative for HBsAg. Therefore we did not exclude the 
samples positive for HBsAg from the GWAS analysis. An 
increased level of CEA has been found in patients with alcoholic 
liver diseases or patients with disorders of respiratory and/or 
gastrointestinal tracts [26]-[28]. However, the association between 
alcohol-drinking and CEA levels was not conclusive in this study. 
Therefore, alcohol-drinking is not a justification factor in the 
GWAS analysis. It has been reported that the serum levels of CEA 
varied among different ethnic-groups [29]. To exclude ethnic- 
specific genetic factors, we also limited the number of Han 
participants in GWAS. 

Of note, we found that age, cigarette smoking and obesity to be 
the epidemiology factors affecting serum CEA levels. Nevertheless, 
the power of the lifestyle or environmental factors in the model 
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Figure 2. Tumor-free survival analysis of CRC patients stratified by preoperative sCEA levels. The tumor-free time was different between 
CRC patients with preoperative CEA levels at <5 and >5 ng/ml (p = 0.001). 
doi:1 0.1 371 /journal.pone.0097923.g002 



was weak because the coefficient of determination was merely 
0.070. 

Genetic variant associated with CEA level in healthy 
populations 

There were 25 SNPs associated with CEA levels at the genome 
wide level based on the analysis using the Multiple Linear 
Regression model. The biological functions of these SNPs were 
listed in Table S8 and genes involved (except for rs81 1 1500) were 
FUT1, FUT2, FUT3, FUT6, ABO, DBF, CA11, RPL18, SULT2B1, 
and FAM83E. The genes FUT1, FUT2, FUT3, FUT6, DBP, CA11, 



RPL18, FAM83E are located in regions 19pl3.3 and 19ql3, 
whereas ABO is located in 9q34.2. FUT1, FUT2, FUT3, FUT6 
and ABO are genes involved in the biosynthesis pathway of ABH 
and Lewis histo-blood group antigens. The five genes encode 
glycosyltransferases that catalyze the stepwise addition of specific 
monosaccharide unit in the biosynthesis process. 

The strongest CEA level-associated SNP was rsl047781 at 
19ql3, a missense change (A->T, Ile->Phe) within the FUT2 
gene that encodes a fucosyltransferase catalyzing the production of 
the H antigen and determines the secretion status of the ABO and 
Lewis histo-blood group antigens. It was reported that the T allele 
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Figure 3. Tumor-free survival analysis of CRC patients stratified by rs1 047781. With rsl 047781 (chr-19 FUT2), the disease-free time of CRC 
patients grouped by allelic genotypes (AA, AT and TT) was not significantly different (p = 0.066). 
doi:1 0.1 371 /journal.pone.0097923.g003 



of this SNP represents a de facto inactivated mutant of the FUT2 
enzyme and was causally associated with the non-secretor 
phenotype in a Japanese population [30]. But the direct 
experimental evidences supporting the biology association be- 
tween various forms of ABO, Lewis histo-blood group antigens 
and CEA levels are absent. Some studies have provided indirect 
evidences that the A antigen moiety coded by ABO is present on 
CEA molecules [31], and that the determinants of A, B and H 
antigens and of CEA shared the same glycoprotein carrier 
molecules [32]. Thus it may be speculated that the genetic 
polymorphisms of various CEA antigen-synthesizing glycosyltrans- 
ferases may modify the sugar groups of the CEA molecules, 
leading to varying stabilities of CEA molecules in the blood, or 
alternatively resulting in different detection rates by the given CEA 
measurement method. 



We found eight SNPs located in the ABO gene, encoding an 
acetyl-galactosammyltransfer- ase that determines the ABO blood 
types. Based on the data from Seattle SNPs (http:/ /pga.mbt. 
washington.edu) as well as the Blood Group Antigen Mutation 
Database (www.ncbi. nlm.nih.gov), an associated study revealed 
that rs507666 is a surrogate for type Al histo-blood group antigen, 
rs687289 is a marker for the O allele, rs8 176746 is a marker for 
the B allele and rs8 176704 is a marker for the A2 aUele [33]. This 
interesting finding might support the association between CEA 
levels and ABO gene in CRC patients; this result is shown in 
Figure 6 and 7. 

Furthermore, the ABO histo-blood group type has been 
associated with a series of diseases characterized by elevated level 
of serum CEA, such as Helicobacter pylori infection, cancer of the 
gastro-intestinal tract, breast cancer, and pulmonary cancer [34], 
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Figure 4. Tumor-free survival analysis of CRC patients stratified by rs81 76746 genotypes. With rs81 76746 (chr-9 ABO), the disease-free 
time of CRC patients grouped by allelic genotypes (AA and AG) was not significantly different (p = 0.366). Subjects with genotype GG were excluded 
(n = 2). 

doi:1 0.1 371 /journal.pone.0097923.g004 



[35]. The ABO blood type group exhibiting a high level sCEA 
might suggest a high risk for these diseases in the ABO carrier 
population [36], [37]. Because of the lack of knowledge of CEA 
molecular synthesis, it is hard to do further bioinformatics analysis 
in addition to that for ABO and FUT genes. 

The purpose of our phase one study was to find loci to be used 
to select candidate loci for SNP genotyping in CRC patients. We 
inferred that SNPs were associated with elevated sCEA levels in 
healthy and CRC population from the same region. Therefore 
candidate SNPs were determined based on rank of genome-wide 
P-value and location of genes relevant to ABO blood type. 
Rs8176746, rs376077, rsl2608544, rs3786749, rs2071699, 
rs441810 and rs507666 were subsequently included in the 
genotyping of CRC patients. 



Genetic variation associated with serum CEA levels in 
CRC patients 

In order to estimate the heterogeneity, potential non-genetic 
confounders affecting CEA levels in healthy populations were 
analyzed in CRC patients at first and revealed that the status of 
lymph node metastasis was a significant contributor. Stratification 
analysis showed that rsl047781 (chrl9- FUT2) was the associated 
loci in CRC patients. This suggests that there might be 
confounders in addition to the status of lymph node metastasis. 
In accordance with the results from the GWAS analysis in healthy 
population, the rs 1047 781 was a susceptible locus for elevated 
CEA level in CRC patients. In healthy population, the CEA level 
of AA genotype was the lowest and TT was the highest (data have 
been adjusted for confounders). Similarly, genotype AA carriers 
had the lowest CEA levels in the CRC patients without tumor 
metastasis. The allele T (thymine) to A (adenine) in rsl047781 is a 
missense mutation. The other study in FAMHES showed 
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Figure 5. Tumor-free survival analysis of CRC patients stratified by rs1047781+rs81 76746 genotypes. Subjects were divided into 2 
groups by genetic maker (rs1047781+rs81 76746). Group 1 was rs1047781 (AA) +rs8176746 (AA+AG) and group 2 was rs1047781 (AT+TT) +rs8176746 
(AA+AG). The disease-free time of subgroups was significantly different (p = 0.023). 
doi:1 0.1 371 /journal.pone.0097923.g005 



rs 1047 781 to be a locus associated with vitamin B12 levels in 
healthy people [38]. However, this was not determined in CRC 
population. So far, there has been no convincing evidence about 
Vitamin B12 and any other lifestyle factors affecting CRC 
recurrence and survival [39]. An in vitro experiment showed that 
the CDX2 gene is associated with the increase in sLe x/a expression 
and suppression of FUT2 in colon cancer cells during EGF/ 
bFGF-induced epithelial-mesenchymal transition (EMT) [40]. 
EMT is a critical phenotypic alteration in metastatic cancer cells. 
We observed that AT and TT genotypes of rsl047781 (FUT2), 
and not the AA genotype, are associated with higher serum CEA 
levels. Interestingly, the CRC patients with T allele exhibited a 



different prognosis from those with the AA genotype. The results 
suggest that CDX2 might be inactive in allele T carriers, resulting 
in a difference of expression between FUT2 and CEA. The allele 
T carrier might be susceptible to tumor metastasis at the early 
stage of CRC and to tumor recurrence in the future. It was 
reported that (31, 3- Galactosyltransferase is down-regulated in 
colon adenocarcinomas and causes CEA express with absence of 
type 1 chain [41], but the relation between FUT2 and CEA was 
still unknown. Therefore laboratory evidence is needed to prove 
whether this polymorphism is biologically functional [42]. 



PLOS ONE | www.plosone.org 



11 



June 2014 | Volume 9 | Issue 6 | e97923 



SNPs Associated with CEA Levels and CRC 



3.50- 



~ 3.00- 
1 

g 2.50- 



> 
< 

U 

E 

3 
u 

<U 

(0 



2.00- 



1.50- 



1.00- 



0.50- 



0.00- 



AA 

(n=208) 



AO 

(n=582) 



00 

(n=1268) 



AB 

(n=286) 



BO 

(n=553) 



BB 

(n=83) 



ABO genotypes 



Figure 6. Distribution of serum CEA levels by ABO allelic genotypes in the healthy population. The CEA levels of male FAMHES subjects 
grouped by ABO groups genotypes were significantly different (p = 1 .47E-26). 
doi:1 0.1 371 /journal.pone.0097923.g006 



Risk factors for preoperative CEA levels and tumor 
recurrence in CRC patients 

The age, cigarette smoking, alcohol drinking and obesity were 
confounders for the serum CEA levels in healthy people. Among 
CRC patients, the status of metastasis in regional lymph nodes and 
distant organs were main confounders of preoperative CEA levels 
in our study. To test if tumor size is a confounder [43], in our 
study, the association between tumor size and pre CEA levels was 
examined yet the association was not significant (see Figure S2). 
BMI is affected by advancing age more than 20 yrs old and it is 
not an independent variety in our analysis [44]. Literatures 
reported that age, obesity, alcohol intake and cigar smoking were 
independent risk factors for the prognosis of CRC. Age (especially 



^75 yrs.) is the important risk factor for patients with CRC 
regardless of the type of therapy and grade of tumor stage [45] . 

For patients who underwent curative resection, the prognosis of 
elderly patients (&75 yrs.) was also worse than the younger 
(25 — 75 yrs) [46] . However, we did not find this association in our 
population. Prospective studies suggested that cigarette smoking is 
associated with the mortality of CRC [47] . Proposed mechanism 
of this pathogenesis is through the DNA damage caused by 
carcinogens of cigar smoke [48,49]. Obesity and the accompany- 
ing chronic systemic inflammation and metabolic alteration are 
risk factors for CRC [50]. It was reported that gender is also a 
potential factor affecting the prognosis of CRC but only in patients 
at age <40 yrs [51]. Thus we considered that it is inclusive and 
did not take it as a confounder in relevant analysis. 
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Figure 7. Distribution of serum CEA levels by ABO allelic genotypes in CRC patients. The serum CEA levels of patients with CRC grouped 
by blood group allelic genotypes were represented by median and there were not significantly different (x 2 = 3.63, p = 0.458). Geneotype AO was 
absent; A1 and A2 were included in A allelic type. The ABO blood type was determined before operation. 
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Genetic variant associated with tumor recurrence in CRC 
patients 

The elevations of CEA levels in healthy and CRC populations 
might suggest biological relationship between synthesis of ABH 
antigen and CEA. Although the preoperative CEA levels were 
associated with ABO allelic genotypes and tumor recurrence, the 
disease-free times of groups with blood types A and O were not 
significantly different in survival analysis (Figure S3). This result 
also suggested that the ABO blood types were not biomarkers of 
tumor recurrence prediction. 

Logistic regression model showed that the preoperative CEA 
level, age and depth of tumor invasion were risk factors influencing 
RLN metastasis. Also, the preoperative CEA level was a significant 
factor contributing to the risk of tumor recurrence in 5 years. It is 
therefore interesting to explore the association between sCEA- 
relevant SNPs and lymph node metastasis. As shown in Table 3, 



rsl047781-TT carriers had lower CEA levels and fewer RLN 
metastases, supporting the association between CEA levels and 
prognosis. Since rs8 176746 was associated with lymph node 
metastasis, the combination of rsl047781 and rs8176746 as 
genetic markers of disease-free survival was applied for further 
analysis. As suggested by Figure 3, rs 1047 781 (AT+TT) carriers 
had worse disease-free survival than the AA carriers. It is in 
accordance with the higher CEA levels among mutant type 
rsl047781 carriers. 

Although these SNPs might associate with tumor recurrence in 
CRC, the overall survival of CRC patients was not significantly 
different in 5 years when grouped by rs 1047 781 or rs8 176746 
(Figure S4, S5). This suggests that overall survival of CRC patients 
was affected by other factors excepted for tumor recurrence. 
Therefore it is worthwhile to test if rs 1047 781 is a prognostic 
marker of tumor recurrence among CRC patients. 
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Overexpression of the CEA gene (and CEACAM6] has been 
found in a series epithelial malignancies including CRC [52], [53]. 
This could disrupt the colonocyte differentiation and promote 
colon carcinogenesis [54]. Higher preoperative sCEA levels were 
associated with higher risk of CRC recurrence after operations. 
Thus the preoperative sCEA level could be served as an 
independent prognostic factor [55], [56], [57]. The mutation in 
the tumor cell's CEA gene (PELPK motif; Pro-Glu-Leu-Pro-Lys) 
might induce high CEA levels by its hepatic clearance; mutation in 
the PELK motif could alter structural stability and binding affinity 
to Kupffer's cell receptor in the liver [58] . In addition, this binding 
results in the release of a series of cytokines that have the potential 
to activate hepatic sinusoidal endothelium [59]— [6 1] . These 
observations imply that the CEA could be a facilitator of hepatic 
metastasis, although direct association between CEA level and 
lymph node metastasis is absent. 

Another study indentified rsl0318 (GREM1), rs6983267 
(POU5F1P1, DQ515897, MYC) and rs4464148 (SMAD7) asso- 
ciated with clinic outcome in patients with stage II and stage III 
CRC treated with 5-FU-based adjuvant chemotherapy [62]. 
GREM1, BMP2 and SMAD7 are signaling molecules in the TGF- 
(3 pathway. Overexpression of TGF-fS signaling genes has been 
found in the progression of CRC [63]. Given the complex process 
of CRC recurrence and metastasis, this finding suggested there are 
other genetic risk factors. Interestingly, it was reported that CEA 
could interact with TGF-(3 receptor and inhibits TGF-(3 signaling 
directly; even enhance the survival of colorectal cancer cells in 
local colonization in animal study [64]. 

Adjuvant therapy of CRC 

In our study, some CRC patients with stage II and all the 
patients with stage III received FOLFOX regiment as adjuvant 
treatment [65], [66]. However, chemotherapy itself was not 
associated with tumor relapse in the five years of follow-up. The 
benefit of adjuvant chemotherapy for patients with stage II CRC is 
still debatable [67]. As suggested by clinical trials the disease-free 
survival of 3 years follow-up might be appropriate index in 
patients with stage II CRC when treated by fluorouracil [68]. 
Thus the follow-up time was up to 4 years in our study. In a meta- 
analysis of adjuvant therapy and survival among patients with 
resected CRC, the longer time to adjuvant therapy was associated 
with worse survival [69]. It suggested that optimal timing of 
adjuvant therapy was an important factor and needed to be 
evaluated in analysis. 

Summary 

We reported in this study the genetic variations associated with 
CEA levels in CRC population. We found that the rsl047781 is 
the susceptible locus for elevated sCEA level whereas rs8 176746 is 
associated with the status of regional lymph node metastasis. The 
elevated sCEA level and status of lymph node metastasis are risk 
factors for tumor-recurrence after radical operation. The SNP 
rs 1047 781 might be a predictor for tumor recurrence in TNM 
stage II and III CRC patients. The contribution of rs8 176746 was 
not addressed in this study. 

A number of limitations of our study must be addressed: (1) In 
addition to CRC, elevated sCEA levels have been reported among 
cholangio-carcinoma, pancreatic carcinoma and breast cancer 
patients. Therefore, it is interesting to explore the influence of the 
candidate SNPs we found in this study in these malignancies. (2) 
Although candidate SNPs such as rs 1047 781 is located in the 
encoding region, its biological function is still unknown based on 
existing evidence. Therefore experiments exploring the regulatory 



mechanisms are definitely needed. (3) Because the samples 
included in our study were limited to Guangxi province, a region 
in southern China, we have yet to determine the significance of 
such associated SNPs in populations of other regions. 
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Figure S3 Survival analysis for disease-free times of 
CRC patients grouped by A and O blood type. The 

difference of disease-free times was not significandy different 

(p = 0.391). 
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significandy different (p = 0.084). 
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Figure S5 Overall survival of CRC patients grouped by 
rsl047781. In rsl047781 (chr-19 FUT2), the overall survival of 
CRC patients grouped by allelic genotypes (AA, AT +TT) was not 
significandy different (p = 0.165). 
(TIF) 
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(DOC) 

Table S6 Summary of the SNPs genotyping in CRC 
patients. There were 194 CRC patients with adenocarcinoma in 
analysis. 
(DOC) 

Table S7 Biological functions of associated SNPs. 

Dataset: http://www.ncbi.nlm.nih.gov/projects/SNP/. 
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